Exploring the assortativity-clustering space of a network's degree sequence 
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Nowadays there is a multitude of measures designed to capture different aspects of network structure. To be 
able to say if the structure of certain network is expected or not, one needs a reference model (null model). One 
frequently used null model is the ensemble of graphs with the same set of degrees as the original network. In 
this paper we argue that this ensemble can be more than just a null model — it also carries information about 
the original network and factors that affect its evolution. By mapping out this ensemble in the space of some 
low-level network structure — in our case those measured by the assortativity and clustering coefficients — one can 
for example study how close to the valid region of the parameter space the observed networks are. Such analysis 
suggests which quantities are actively optimized during the evolution of the network. We use four very different 
biological networks to exemplify our method. Among other things, we find that high clustering might be a force 
in the evolution of protein interaction networks. We also find that all four networks are conspicuously robust to 
both random errors and targeted attacks. 



PACS numbers: 89.75.Fb, 82.39.Rt, 89.75.Hc 

I. INTRODUCTION 

Network structure (Q]; HI; l24l) is usually defined as the way 
a network differs from what is expected. What "expected" 
means depends on the fundamental constraints on the net- 
work, and this can vary from system to system. For exam- 
ple, if the network is made of units that must be connected to 
two, and only two, others; then, it is not interesting whether 
or not a vertex lies on a cycle (we already know that it will). 
The ensemble of all networks fulfilling the fundamental con- 
straints on the system is usually called null model (or refer- 
ence model). When we have pinned down the null model 
we can measure the network structure by standard quantities. 
If the values of these quantities differs significantly from the 
null-model average, then we call the network structured. The 
baseline assumption of complex network theory is that net- 
work structure carries information about the forces that have 
formed the network. Ever since the studies of Barabasi and 
coworkers ^ HI), the degree distribution (or, if referring to 
the set of degrees of one particular network, degree sequence) 
has been regarded as the most fundamental network structure. 
For many networks, the degrees are related to outer factors 
(not emerging from the network evolution). In such cases the 
ensemble of all graphs with the same degree sequence as the 
original network is a natural null model. Another interpre- 
tation is that the network structures measured relative to this 
null model are of higher order than the degree — i.e., what re- 
main after the effects of the more fundamental structure (the 
degree sequence) is filtered away. The usual way to use a null 
model is to compare a network measure with the ensemble av- 
erage value of the null model. In this paper we will argue that 
one can glean more information about the original network by 
studying the null model ensemble in greater detail than just 
measuring averages. 



We consider networks that can be modeled as a graph 
G - (V, E) where V is the set of vertices and E is the set of 
M undirected edges. We denote the ensemble of graphs with 
the same degree sequence as G as Q{G). Our basic approach 
to study ff(G) is to resolve its members in the space of higher 
order network structures. The two such higher order network 
structures we consider in this paper are: the correlation be- 
tween the degrees at either side of an edge (measured by the 
assortative mixing coefficient, r (23), or simply assortativity); 
and, the fraction of triangles in the network (measured by the 
clustering coefficient, C fiH^))- By mapping out Q{G) in the 
space defined by r and C one can pose questions such as: How 
large is the region in r-C space where members of @{G) actu- 
ally exist? (This helps us answer how constrained the network 
evolution is if the degrees are given.) Is the real network close 
to ^(G)'s boundaries in r-C space? (Which would indicate 
whether or not r or C are actively optimized.) 

The basis for our exploration of an ensemble QiG) is to 
map out its members in the space defined by some network- 
structural measures, in our case the assortativity and cluster- 
ing. We explore the r-C space by successively rewire pairs of 
edges, (/, i) and (/', /) to (;, /) and (;', J), that takes the system 
in a desired direction. Rewiring techniques for studying net- 
works are half a century old d 1 Ol) (ra ndomization for obtaining 
null models was studied in Ref. (fla)). In the phy sics literature 
these techniques were first used in Refs. 



II. NETWORK STRUCTURAL MEASURES 

Before going into details of our algorithm, we will review 
the network structural quantities that we use to describe our 
networks: both the independent variables (the assortative and 
clustering coefficients) that form the basis for our space of in- 
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terest; and the quantities we use for characterizing the regions 
of this space. 



A. Assortative mixing coefficient 

It is quite well accepted that the set of degrees, the degree 
sequence, is the network quantity that contains most infor- 
mation about both the evolution and function of the network. 
Degree can (in most contexts) be identified as how influential 
the vertex is (Isol) (in some sense) — high degree vertices are 
assumed to be more influential both the formation of the net- 
work and the flow of dynamic systems on the network. In this 
paper we assume the degree sequence is inherent to the system 
and look at higher order structures arising from how the ver- 
tices are linked to one another The simplest such higher-order 
structure is the correlations between the degrees of vertices at 
either side of an edge. Is it the case that high-degree vertices 
are primarily connected to other high degree vertices, or are 
they linked to low-degree vertices? A simple way of mea- 
suring this tendency is by the assortativity |24j) r. Basically 
speaking, r is the linear correlation coefficient of the degrees 
at either side of an edge. One complication is that since the 
edges are undirected, r has to be symmetric with respect to 
edge-reversal, but the correlation coefficient is not symmetric. 
The solution is to let one edge contribute twice to the covari- 
ance, i.e. represent an undirected edge by two directed edges 
pointing in opposite directions. If one use an edge list repre- 
sentation internally (i.e., let the edges be stored in an array of 
ordered pairs (iujx), ■■■ , (/m, Jm)) then (23) 

A{hk2)-{h+k2f 

r — ( 1 ) 

2{k\ + kl)-{k,+k2)^ 

where, for an edge (/, j), k\ is the degree of first argument 
(i.e., the degree of and k2 is the degree of the second argu- 
ment. The range of r is [-1, 1] where negative values indi- 
cate a preference for high connected vertices to attach to low- 
degree vertices, and positive values means that vertices tend 
to be attached to others with degrees of similar magnitudes. 

B. Clustering coefficient 

Several simple random network models (such as the Edros- 
Renyi (9) or the model for generating networks of a given r- 
value in Ref. Q) have rather few triangles (fully connected 
subgraphs of three vertices). For some classes of real-world 
networks (notably social networks (12)) there is a strong ten- 
dency for triangles to form, which makes such models fail. 
The network measure of the density of triangles is called clus- 
tering coefficient. We use the definition of Ref. (i): 

C = 3ntriangle / ^triple; (2) 

where nmangie is the number of triangles and Wtripie is the num- 
ber of connected triples (subgraphs consisting of three vertices 
and two or three edges). The factor three is included to nor- 
malize the quantity to the interval [0, 1]. 



C. Distance and component size 

Two quantities that are, perhaps more than any other, re- 
lated to the functionality of dynamic processes on the network 
are the relative size of the largest component (connected sub- 
graph) s, and the average distance (d). s is simply defined as 
the number of vertices in the largest component divided by A^. 
The distance d{i, j) between two vertices / and j is defined as 
the number of edges in the shortest path between these two 
vertices, {d) is d{i, j) averaged over all vertex pairs (/ j) in 
the largest component. In a network with large s and small 
(d), spreading processes will be fast and far-reaching. This is 
a good property of information networks, but bad in the con- 
text of, for example, disease spreading. Some authors have 
combined the distance and component size aspects by con- 
sidering the average reciprocal distances ( fT4t [itI) . For most 
purposes, we beUeve, valuable information gets lost in such a 
combination (a fragmented network G with short average dis- 
tances can be something very diff'erent from a connected graph 
of large distances and the same average reciprocal distances as 
G). 



D. Robustness 

One line of complex network research is the study of the 
response of the network to attacks, errors, failures and other 
events that efifectively change the structure. The error re- 
sponse problem is usually formulated as: how does the func- 
tionality of the network change if a random fraction of the 
vertices, or edges, is removed (1241) ? The attack problem is the 
same, except that the vertices are not selected randomly but 
according to some strategy intended to decrease the networks' 
functionality as rapidly as possible (BfTil). A frequently used 
metric for functionality is the ratio of s before and after the 
event (2; 14; 22). In the error and attack robustness problems, 
this quantity is typically plotted as a function of the number 
of removed vertices. The idea is that even if one network G 
is more robust than another network G' to the removal of, 
say, 1% of the vertices, G' can be less vulnerable than G if 
10% of the vertices are deleted. Since we aim at mapping out 
the r-C space of degree sequences, we would like to capture 
the robustness with just one number. We will use what we 
call the f -robustness Rf of a network as the expected frac- 
tion of vertices that needs to be removed for the relative size 
of the largest component to decrease to a fraction / E (0, 1) 
of its original value. The way of removal can either be ran- 
dom (the error problem) or selective (the attack problem). For 
the rest of the paper we will set / = 1 /2, and refer to the 1/2- 
robustness just as "robustness" 7?. Other /-values give slightly 
different results, but our conclusions will hold for a range of 
intermediate /-values. 



III. THE ANALYSIS SCHEME 

The fundamental idea of our method is simple: we update 
the network by choosing pairs of edges randomly, say (/, j) 
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FIG. 1 Illustration of the analysis scheme applied to the C. elegans 
neural network, (a) shows how the valid region is mapped out: 1. 
Jmin is located. 2. r^ax is found and the interval [r„,jn, r^ax] is divided 
into L segments. 3. Cmi„(«) is constructed. 4. Cmax('j) is traced and 
the interval [Cmin, C„,ax] is segmented into L regions, (b) illustrates 
the sampling of the pixels. The next pixel to go to is chosen from a 
random permutation of the pixels. In this example n and n' are cho- 
sen to be far apart. The line shows the path taken by the algorithm. 
The circles indicate every thousandth step on the way from n to n' . 
The blow-up illustrates the random walk within a pixel to sample the 
graphs of the pixel more randomly. 



and (/',/), and swap one end of them (forming (/, /) and 
{i' , ]))■ This guarantees that the degree sequence stays intact. 
We navigate in the r-C space by only accepting changes that 
move us in the desired direction. If an edge-swap would intro- 
duce a self-edge (i.e. if / - f or /' = j) or a multiple edge (i.e. 
if (/, /) or (/', j) belongs to E before the swapping, or move) it 
is not performed. There are many other technicalities concern- 
ing the convergence to extremes, uniformity of the sampling 
and more that we discuss in the Appendix. 

The members of the ensemble Q{G) do not, in general, 
cover the whole range of (r, C)-values. Indeed, for any finite 
G, QiG) defines a set of points, rather than a continuous re- 
gion, in the r-C space. We will perform a more coarse-grained 
analysis breaking down the r-C space into pixels and average 
quantities over the graphs of QiG) with (r, C)-values within 
the pixel. (Thus, a pixel constitute a graph ensemble in itself, 
our aim is to sample its members with uniform randomness.) 



For a computationally tractable resolution, the pixels contain- 
ing members of Q{G) typically form contiguous regions. We 
will refer to the pixels that contain a member of Q{G) as valid 
pixels, and all pixels that are valid or between valid pixels the 
valid region of Q(G). 

To trace the valid region of @{G) we start by finding the 
lowest and highest assortativity value, rmin and r,nax respec- 
tively. Briefly speaking (more details follow below), to find 
'"min we rewire edge-pairs that lower r (and vice versa for 
'"max)- After finding the extremal r-values, we splice the re- 
gion between these into L segments. Then we go through 
the region and for each region n e [1,L] we find the mini- 
mal and maximal C-values, C,nin(n) and Cmax(n)- The region 
in C-space between the lowest Cmin = inini<„<z, Cmin(n) and 
highest Cmax = maxi<„<£ Cn,ax(n) observed clustering coeffi- 
cient is segmented into L regions. (Note that Cmin, without 
argument, is the global clustering minimum, whereas Cmin(n) 
is the minimum conditioned on r being in the «'th segment.) 
Thus we (assuming our method works) obtain an L x L grid 
of the r-C space that contains the vaUd region of 0(G). The 
method is illustrated in Fig.[T] 

To find the 0{G) elements of minimal and maximal assor- 
tativity is a non-trivial optimization problem. There are de- 
terministic methods that, if they terminate, are guaranteed to 
give the maximal (or minimal) assortativity ( 18; 32). To avoid 
the such technicalities and to simplify the program, we will 
use the same kind of optimization algorithm to find rmax and 
rmin as to find Cinin(n) and Cmax(n)- In the Appendix we will 
argue that this method allows us to come as close to the op- 
timal r-values as we need. A method we find efficient is to 
repeat the simple edge-pair swapping procedure (where only 
changes in the desired direction are accepted) with different 
random seeds until no lower state is found during a number 
Viep. of repetitions (29). Each individual edge-pair is termi- 
nated when no lowest state is found for Vsame swaps. In gen- 
eral, the larger the network is, the more densely distributed 
are the points close to the border of the valid region. If one 
is satisfied with finding a value a certain distance from the 
extrema, then Vrep. and Vsame do not need to be increased for 
larger A^. To find Cmin(n) and C,Tiax(n) almost the same pro- 
cedure is employed. First, edge-pairs are swapped until the 
desired segment of r is found. Second, unless r is outside the 
segment n and the move takes the system yet further from seg- 
ment, edge-pairs are swapped provided the clustering would 
decrease (for Cinin(n)), or increase, (for Cmax(n))- When the 
valid region is traced out and we sample networks of different 
pixels, we select the pixels randomly. The idea is to sample 
the space of networks more randomly. 

To summarize, the algorithm for finding the extremal assor- 
tativity values, Tmin and r^ax, is: 

1. Choose two undirected edges (i,j) and (/',/) at ran- 
dom. If the program makes a difference between the 
arguments of the edge, the direction of the reading of 
the edge also has to be randomized (so (/, j) is read as 
(j, i) with probability 1/2). 

2. Check if swapping these edges to (/, /) and (/', j) would 
introduce a self-edge or multiple edge in the network. If 
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so, go to step[T] 

3. Let Ar be the change in r if the move in step [1] is exe- 
cuted. If r is to be minimized and Ar < 0, then accept 
the change (vice versa for maximization of r). 

4. If no move has been executed during the last Vsame ex- 
ecutions of step |3] then take the current r as r^in (or 

^max)- 

5. Repeat from the beginning Vrep. times and return the 
lowest observed rmin during these iterations. 

Given r^in and r^ax, and a division of the r space into L 
segments of width (r^ax - rmiid/L, we trace the boundaries of 
the valid region as follows: 

6. Go through the regions sequentially. Say then' th region 
is the interval [r„, r„+i). 

7. Perform step [T] and |2] of the assortativity optimization 
algorithm. 

8. Let AC be the change in clustering coefficient during 
the previous step. If r < r„ and Ar > 0, r > r„+\ and 
Ar < or r„ < r < r„+\ and AC < (for minimization) 
or AC > (for maximization), then perform the change 
of step|6l 

9. If, counting from the first time the system entered the 
desired r-segment, the minimal (maximal) C-value has 
been repeated Vsame times, take this value as Cmin(n) 

(Cmax(n))- 

10. Repeat from step|6]viep. times. Let the lowest Cmin(n)- 
values and largest Cn,ax(n) during these iterations be 
Cinin(n) and Cmax(n)- 

Then, when the valid region is mapped out, we split the C- 
range (between C^in and Cmax in L segments of equal width, 
thus forming an L x L-grid enclosing the valid region. This 
grid is sampled as follows: 

1 1 . Construct a random permutation of the valid pixels. 

12. Pick the next pixel P„ = {r„,r„+\) x [C,„,C,„+i) from 
the index-list of step [TT] Denote the center [(r„ -H 
r„+i)/2, (C„, + C„,+i)/2)] of the pixel (r„,o, C^.o). Let 

Mr, C) = i^^^Ui^^^^ (3) 

\ \ ^max ^min / \^max ^min/ 

measure the distance in r-C space from the current po- 
sition (r, C) to the center of the target pixel. 

13. Pick edge-pair candidates according to steps [T]and|2]of 
the assortativity optimization algorithm. 

14. Calculate A(r,C) = 6{r\C') - 6(r,C) where r and C 
are the current assortativity and clustering values, and 
r' and C are the values after the pending move has been 
performed. If A(r, C) < perform the move. 



TABLE I Basic statistical properties of the example networks we 
use. The number of vertices A', number of edges M, assortativity r, 
clustering coefficient C, relative size of the largest cluster s, average 
distance in the largest cluster (d), the error robustness terror and the 
attack robustness /^attack- 
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neural 
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291 


4168 


1905 


280 
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278 


7434 


3526 


1973 


r 


-0.36 


-0.13 


-0.10 


-0.069 


C 


0.0016 


0.034 


0.039 


0.20 


s 


0.38 


0.94 


0.87 


1 


id) 


4.2 


4.8 


4.5 


2.6 


terror 


0.43 


0.36 


0.36 


0.50 


^attack 


0.012 


0.048 


0.046 


0.38 



15. If the updated (r, C) belongs to P„, then: First, make 
Vind. random edge swappings such that (r, C) does not 
leave P„. (This is to sample the pixel more uniformly.) 
Then, measure network structural quantities of P„, save 
these values for statistics, and go to step[T2l 

16. If not all pixels have been measured go to step[T2l 

17. Go to step[TT]until each pixel have been sampled Vsamp. 
times. 

The parameter values we use in this study are (unless other- 
wise stated): Vsame =10^, Vrep. = 5, Vsamp. = 100, Vrnd. = 1000 

and L - 50. The choice of parameters and further consid- 
erations are discussed in the Appendix. Due to the uncertain 
stopping conditions of steps ID |5j H] and [10] it is hard to de- 
rive meaningful bounds on the computational complexity. We 
note, however, that the optimization is faster in r- than in C- 
direction, this probably relates to the observation in Fig. [Tib) 
that swapping procedure moves faster in the r- than in the C- 
direction. (The speed in the C-direction is roughly the same 
per 1000 steps, but the speed in the r-direction decrease.) 

IV. NETWORKS 

Our method can be applied to every kind of system that 
can be modeled as an undirected network. To limit ourselves, 
we use four networks from biology as examples in this paper 
These networks are, nonetheless, representing fundamentally 
different systems. 

A. Gene fusion network 

Cancer is a disease that occurs due to changes in the 
genome. One important process causing such changes is gene 
fusion — when two genes merge to form a hybrid gene (21). 
In Ref. (fill) the authors construct a network of human genes 
that have been observed to be fused in the development of 
tumors in humans. Some genes can fuse with many others 
but most of the genes have only been observed fusing with 
one, or a few others. The resulting network structure has 
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a skewed, power-law like degree distribution and is rather 
fragmented — the largest component spanning only 38% of the 
vertices. Statistics of this and the other networks are listed in 
Tablel] 



B. Metabolic network 

A cell can be regarded as a machine driven by biochemi- 
cal reactions. The possible reactions of the metabolism (the 
cellular biochemistry except signaling processes) and its en- 
vironment determine the state of the cell. The metabolism 
of an organism is a very complex system — so complex that 
one has to choose between studying a part of it in detail, or 
the whole with a coarser method. One approach in the latter 
category is to construct a network, connecting the chemical 
substrates occurring in the same reactions to a network, and 
employ network analysis to characterize the large-scale struc- 
ture of the metabolism. The way to construct a biochemical 
network is not entirely straightforward (HH). Should the sub- 
stances be linked to each other (in a substrate graph), or to 
the reactions they participate in? If one use a substrate graph, 
should the substrates be linked only to products, or to all re- 
actants (i.e. in a reaction A H- B C H- D, should A be linked 
to C and D, or to all three other vertices)? Furthermore, some 
chemical substances (like H2O, ATP, NADH, and so on) are 
abundant throughout the cell and seldom pose any restriction 
on the reaction dynamics. For many purposes, one obtains a 
more meaningful network by deleting such currency metabo- 
lites. The biochemical network we use is the human metabolic 
network of Ref. JTsI) . In this network, substrates are linked 
only to products (A to C and D in the above example). Cur- 
rency metabolites are identified and deleted according to a 
self-consistent, graph-theoretic method dTsI) . 



C. Protein interaction 

In protein interaction networks the vertices are proteins and 
two proteins constitute an edge if they can interact physically. 
Examples of interaction are the ability to form complexes, car- 
rying another protein across a membrane or modifying an- 
other protein. We use the ("physical interaction") data set 
from Ref. (13) of protein interaction in the budding yeast S. 
cerevisiae. 



D. Neural network 

For the biochemistry of an organism, the network repre- 
sentation is a crude model of the system as a whole (as an 
alternative to a detailed model of a subsystem). Neural net- 
works are yet more complex. For these the choices are either 
to make a coarse-grained network representation ( 27) or study 
the full network of a very simple organism. In this work, we 
take the latter approach and study the neural network of C. 
elegans jSlI) . In this data set, the strength of the neuronal cou- 
pling has been measured, but we make the network undirected 



by letting an edge represent a non-zero coupling. 



V. NUMERICAL RESULTS 

In this section we present numerical results for our four 
network-structural measures over the Q{G) ensembles of the 
four test graphs. To get a first view, we display the valid re- 
gion of the gene fusion graph in Fig.|2|a). As seen, the valid 
region is not covering a large part of the theoretical limits of 
r (-1 < r < 1) and C (0 < C < 1). (Note that only fully 
connected graphs have C = 1, and for these r is undefined.) 
The requirement that the graph should be simple (no multi- 
ple edges or self-edges) puts hard constraints on the actual 
r-values that can occur (cf. Ref. dlcj)). Fig. |3a) shows that, 
considering the entire r-C plane, such constraints are even 
harder The general shape of the valid region is consistent 
with the observations that the simple-graph constraint induce 
a positive correlation between r and C (i2Cki25"l) . 

In Fig. |2b), (c) and (d) we show three example networks 
of @(G) (where G is the gene fusion network). Fig.|2lb) dis- 
plays the relatively fragmented real network. Fig. |2|c) is a 
random network G' with the almost the same r-C coordinates 
as the real network {6{G, G') ^ 0.0026). Maybe the biggest 
visible difference between G and G' is the larger size of the 
largest component of G'. Is it true that the gene fusion net- 
work is unusually fragmented, given the degree sequence and 
r-C coordinates? If so, there might be an evolutionary pres- 
sure for gene fusion networks to be fragmented. (This will 
be discussed further in Sect. IV. Al ) Fig.|2ld) shows, as a con- 
trast, a network far away from G and G'. The network has a 
well-defined core where high-degree vertices connect to each 
other There are also a number of peripheral triangles, which 
indicates that the network evolves toward a maximal C-value, 
given its assortativity. 



A. Location in r-C space and size of largest component 

In Fig. [3] we plot the relative size of the largest compo- 
nent of the four test networks. We also display the loca- 
tions of the actual networks in the r-C plane, and the Q(G)- 
averages. (The @(G) averages are obtained from a rewiring 
sampling of QiG), with step|2]of the algorithm as the only con- 
straint.) We see that the C-value of the gene fusion graph lies 
close to the Cinin(r)-boundary of its valid region. C averaged 
over the whole QiG) is about three times larger i{C)g(c) - 
0.0061 + 0.0001) than the observed value (C = 0.0017). Fur- 
thermore, we see that the assortativity is lower than the 0{G) 
average. This kind of analysis has been used by many au- 
thors (following Ref. (fl9l) ). The interpretation is usually that 
the network is, effectively, disassortative and clustered (i.e., 
r < {r)g(c) and C > {C)g{c))- However, looking at the entire 
valid region, we can get another perspective: If high cluster- 
ing really would have been an important goal for the network 
to obtain (given the degree sequence) there is large room for 
improvement. For the assortativity, on the other hand, the ob- 
served network is rather close to the minimum. This might 
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FIG. 2 The valid region demarcated by the Cmi„(r)- and Cmax('')-curves (a), and three networks: (b) is the original gene fusion network, (c) 
shows a random sample with r-C coordinates close to those of the real network, (d) shows a network with high clustering and high assortativity. 
The largest component of (b), (c) and (d) are indicated with a different color. 
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FIG. 3 The relative size of the largest component s as a function of r and C. The networks are (a) the network of gene fusions in tumors 
in humans, (b) protein interaction network of 5. cerevisiae, (c) human metabolic network and (d) the C. elegans neural network. The x-like 
symbols of the main figures and the diamond symbols of the color-bars indicate the values of the real networks. The plus-like symbol indicates 
the average (r, C)- value of the Q(G) ensemble. 
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be telling us that assortativity is a more important factor, than 
clustering, in the evolution of the gene fusion networks. The 
protein interaction network of Fig. Ob) is located quite far 
from the ensemble average — the assortativity is much lower 
than the ^(G)-average, and given that assortativity, the clus- 
tering is maximal. Also the metabolic (Fig. He)) and neural 
(Fig- Eld)) networks are more clustered than the average, but 
here the assortativity is slightly larger than the ff(G) average. 
From Fig. [3] we also note that the density of states is very in- 
homogeneous distributed — the average (r, C) is close to C = 
and (except for the neural network) left of the middle of the 
assortativity spectrum. The shapes of the valid regions are 
rather similar, with an exception for the broader region of the 
neural network. This can be related to the more narrow degree 
sequence of the neural network ( 3). We have established a cor- 
relation between r and C. Ref. (,25i) argues that such correla- 
tion occurs in social networks because of their modularity (or 
"community structure" as the authors call it). However, our 
large-r networks have no explicit bias towards high modular- 
ity, which leads us to conjecture that the correlation between 
r and C, or more fundamentally the sum Yj(i,j)eE ^i^j (which, 
given a degree sequence, is the only factor of Eq. [T] that can 
vary) is a more general phenomenon. Since r is normalized 
by, essentially, the variance of the degree, it follows that the 
valid region for 0{G) with more narrow degree sequence will 
appear stretched (larger). 

Turning to the average size of the largest component, 
we observe that the gene fusion network is indeed more 
fragmented than the average network of the same (r, C)- 
coordinates (as anticipated from comparing Figs. W[h) and 
(c)). The protein interaction and neural networks have no par- 
ticular bias in this respect, whereas the metabolic network is 
more fragmented than expected. The relatively low s of the 
metabolic network can be attributed to the "modularity" of 
such networks (fisl : [33I) . Such modules are subgraphs that 
are densely connected within, and sparsely inter-connected. 
Sometimes they are even disconnected from the largest com- 
ponent (which explains the lower s). In general, s decreases 
with assortativity. This is natural — in more assortative net- 
works high degree vertices are connected to each other, form- 
ing a highly connected core and a periphery too sparse to be 
connected (viz. Fig. |2c) and (d)). For the denser networks 
(the protein interaction, metabolic and neural networks) s in- 
creases with C (for a fixed r). For the sparser gene-fusion 
network s has a peak at intermediate C. We do not speculate 
further about combinatorial cause of these dependencies; but 
we note (comparing e.g. Figs. |2a) and (b)) that even though 
the shape of the valid regions are similar, the s behavior can 
be qualitatively different. 



B. Distances in the largest component 

In Fig.|4]we display the average distance in the largest com- 
ponent. As mentioned, measuring the distance can give com- 
plementary information to the s{r, C) graphs of Fig. [3] — while 
s tells us how much of the network that can be reached, {d} 
tells us how fast that can happen. For all networks the big 



picture is that large connected components have large average 
distances. This is expected from most network models. There 
is, however, more information than this in Fig. HI for compo- 
nents of the same size, the average distance is increasing with 
both r and C. That (d) should increase with C seems quite 
natural — if one of a triangle's edges is rewired to connect two 
distant vertices, the distances in the surrounding of the tri- 
angle would increase with one, but this would be more than 
compensated by the connection of the two previously distant 
areas. Disassortative networks typically lack a well-defined 
core. Such cores are known to keep the average distance of 
general power-law networks short (7). Thus one would expect 
an increase of r to cause a larger (d), but apparently the clus- 
tering related length-increase outweighs this effect. In con- 
trast to the relative size, the average distances of the real net- 
works are close to the ^(G)-averages at the same r-C coor- 
dinates. For the gene fusion network (with a relatively small 
largest component), this means the distances are rather large. 



C. Error robustness 

Next, we turn to the error robustness problem. As seen in 
Fig.|5]the gene fusion network (Fig.|3a)), once again, has a 
qualitatively different behavior than the other three networks 
(Fig-Elb), (c) and (d)). While the gene fusion network is most 
robust for high r- and C-values the other networks are most 
robust for low r. A sketchy explanation can be found in the 
chain-like subgraphs extending from the largest component in 
a large-r network (cf. Fig. |2]i — with a random deletion of ver- 
tices, these subgraphs are likely to be disconnected from the 
core rather soon (whereas in a disassortative network alterna- 
tive paths may still exist), then if the deletion-robust core is 
less than half of the original component size it follows that 
it may soon be isolated. The sparsity of the gene fusion net- 
work makes the low-r g'(G)-graphs much like trees (i.e., hav- 
ing few cycles), and since cycles provide redundant paths that 
can make a network robust, it follows that these graphs are 
fragile. For a fixed r, /?eiTor is a decreasing function of C for 
the three largest networks. We believe this is an effect of the 
local path redundancy induced by triangles — if one vertex of 
a triangle is deleted, the other two are still connected. 

The /?enor-values for the real networks are always markedly 
higher than the ^(G)-averages for the same (r, C)-coordinates. 
Networks with highly skewed degree distributions (the gene 
fusion, protein interaction and metabolic networks) are known 
to be robust to errors by virtue of degree distribution alone 
now Fig. |5] tells us that all these networks have a yet higher 
error tolerance which is an indication that error robustness is 
an important factor in the evolution of these networks. 



D. Attacl< robustness 

The final quantity we measure is the attack robustness (see 
Fig.|6]l. ^attack's functional dependence on r and C is quite dif- 
ferent from that of /?enoi - The gene fusion 0(G) has the high- 
est attack robustness at high r- and low C-values. The other 
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FIG. 4 The average distance within the largest component {d) as a function of r and C. The panes and symbols correspond to those of Fig. [3] 
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FIG. 5 The error robustness i?eiTor as a function of r and C. The panes and symbols correspond to those of Fig. [3] 



networks have higher robustness values for high assortativity, 
but no clear tendency in the C-direction. The attack mech- 
anism we study targets the high degree vertices. Having all 
high degree vertices connected to each other is probably the 
only way to keep the network from instantaneous fragmenta- 



tion. The observed r-dependence is thus rather expected. The 
real-world networks all have /^attack-values of the same order 
of magnitude as the average values for the QiG) networks of 
the same location in r-C space. We note that for studying 
the attack problem of metabolic networks, the (less common) 
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FIG. 6 The error robustness i?anack as a function of r and C. The panes and symbols correspond to those of Fig.|3] 



TABLE II Summary of the network structural measures of the real 
world networks relative to the average values of the Q(G) a distance 
6 < 0.02 from the real network. "<" indicates that the real network 
have a lower value than the corresponding ^(G)-value. All results 
are significant with p-values > 0.01, except the i-value of the neural 
network that has a p-value of ~ 0.05. 
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enzyme centric graph representation is more appropriate (see 
Sect. IIV.BI) . The reason being that one can suppress an en- 
zyme much easier than removing the substrates. 



E. Comparison between the graphs 

Even though all our example networks are constructed 
from biological data, they represent fundamentally different 
systems — the neural network is spatial by nature, the protein 
interaction and (even more so) the metabolic networks are the 
background topology for an active dynamic system, whereas 
the gene fusion network is a representation of possible but 
undesired events. The protein interaction, metabolic and neu- 
ral networks have one thing in common — the organism needs 
them to be robust to errors (caused by injuries, mutations, dis- 
ease etc.) (28*). As mentioned above and summarized in Ta- 



ble [n] the error robustness is indeed higher for the real net- 
works than the ^(G)-ensemble at the same (r, C)-coordinates. 
As mentioned above, the attack robustness of the real network 
is of the same order as the ^(G)-average at the same (r, C)- 
coordinate, but actually there is a significant tendency that 
these network also are more robust to attacks. Furthermore, 
the distances in the largest component, and the relative sizes 
s are (with the neural network i-value as the only exception) 
smaller in the real than the 0(G) networks. 

Despite these similarities between the statistics of the real- 
world networks the r-C space of the different degree se- 
quences have qualitatively different network structure. Espe- 
cially, the gene fusion network behaves almost the opposite of 
the other networks (at least for s, {d) and /?enoi )■ The source of 
this opposite behavior (as we discuss above) is probably that 
it is much sparser than the other networks. The neural net- 
work is the densest network and the only one that do not have 
a power-law like degree distribution. 



VI. DISCUSSION 

Many complex network studies use the ensemble @{G) of 
graphs with the same degree sequence as the subject graph G 
as a null model. In contrast to a generative network model, 
with a few degrees of freedom that has to be fitted approxi- 
mately, such an ensemble has 0{N) degrees of freedom that 
can be matched exactly with the values of G. We argue that 
0(G) is more than a null model — ^by resolving the graphs of 
0(G) in a space defined by some network-structural measures, 
one can get a picture of the opportunities and limits there are 
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(or has been) in the evolution of G. In this work we map out 
ff{G) in the two-dimensional space defined by the clustering 
coefllcient and the assortativity. Then we measure other net- 
work structural quantities throughout this space. One formal 
way to see our method is that we resolve Q{G) in the (high 
dimensional) space of all sensible network measures. Then, 
for simplicity, we project to a few dimensions. (The case of 
projection to one dimension has been studied in a less formal- 
ized way earlier — ^projection to assortativity (f32i) or a "hier- 
archy" measure (4).) An interesting open question is to find 
the principal components of the space of all sensible network 
measures. Using four example networks from biology, we 
measure average values of four network-structural quantities 
over the r-C space and compare these with the values of the 
real networks. 



The functional characteristics of the r-C spaces varies much 
between the four example networks. For example, the C. el- 
egans neural network covers a much larger area of the r-C 
space, something that probably relates to its more narrow de- 
gree distribution. The human gene fusion network, on the 
other hand, has a broad degree distribution similar to the S. 
cerevisiae protein interaction and human metabolic networks, 
still the structural dependency on r and C is very different for 
the gene fusion network compared to the others. We argue that 
this difference stems from the sparseness of the gene fusion 
network. To achieve a comprehensive understanding about 
how the network structure throughout the r-C space depends 
on the degree sequence, one would need a systematic inves- 
tigation of different artificial degree sequences. In this paper, 
we do not pursue this goal beyond the analysis of the four bio- 
logical data sets. The position of the real networks in the valid 
region of the r-C space adds some further information. For ex- 
ample, it may have been the case that networks with lower as- 
sortativity have been favored during the evolution of the gene 
fusion network. Clustering, on the other hand, has probably 
not put any constraint on the network evolution. Furthermore 
we compare the network structure of the real networks with 
the average values of networks in 0(G) that are close to the 
(r, C)-coordinates of the real network. From this analysis, we 
conclude that all our four example networks are more robust 
to both random errors and targeted attacks than what can be 
expected from a random network constrained to the same de- 
gree distribution, assortativity and clustering coefficient. For 
all networks, except maybe the gene fusion network, this is in 
line with robustness being an important factor in the network 
evolution. Note that in this work we assume the subject net- 
work to be accurate. To get more valid error estimates one 
would need to take the accuracy of the edges into account. 



The analysis scheme presented in this paper can be further 
extended and analyzed. As mentioned, it would be interest- 
ing with a quantitative evaluation of the network-structural 
spaces, and how they depend on the degree sequence. One 
can also try, for time-resolved data sets, to incorporate dy- 
namic information in the analysis by monitoring the network- 
evolutionary trajectory in the r-C space. 
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FIG. 7 Convergence of the optimization algorithm, (a) shows the av- 
erage maximal assortativity (r^ax) with Vrep. = 1. The horizontal line 
represents the result of the maximization algorithm of Ref. JTsI) . (b) 
shows the further improvement by finding the maximum over many 
independent runs (for Vsamc = 10000). The vertical bars indicates the 
standard deviation of the observed maxima. 
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APPENDIX A: Convergence and sampling uniformity 

In this Appendix, we address some technical issues of our 
method related to the convergence of our optimization algo- 
rithm and uniformity of the sampling. We will also motivate 
our choice of parameters. 



1. Assortativity and clustering extremes 

To find the extremal assortativity values we use the edge- 
swapping algorithm described in Sect.|lll] To find rmin we start 
from a random member of Q{G) and swap random edge-pairs 
(keeping the graph simple at all times) that lower r. When no 
graph of lower r has been found for Vsame time steps, we break 
the iteration. To avoid the effect of being trapped in local min- 
ima, this process is repeated Vrep. times. The main motivation 
for using this method is that it is at heart the same scheme as 
for obtaining the extremal clustering values and sampling the 
valid region (and thus we can re-use the same code for many 
steps of the calculations). In this section, we argue that the 
optimization performance of this method is sufficiently good 
for our purpose. 

There is a deterministic method to maximize the assorta- 
tivity that is, if it exits properly, guaranteed to find rmax 
The method works as follows: First all vertex-pairs (/, j) are 
ranked in decreasing order of the product of their degrees, kikj. 
Then the edges are added in order of this list unless the degree 
of one of the vertices already is fulfilled. There are some other 
technicalities from the additional constraint (of the authors) 
that the network should be connected. Of our networks, only 
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the neural network has such an evolutionary constraint, so we 
do not impose it. 

In Fig.|2]we display the parameter dependence of the con- 
vergence for the gene fusion network. The horizontal line 
is the theoretical maximum obtained by the algorithm of 
Ref. in. When Vsame — 10000 we obtain an average max- 
imal assortativity within 0.001 of the theoretical maximum 
(Fig- Ha)). By increasing Vjep. the accuracy can be increased 
further (Fig. Hb)). The lattice spacing we use is 0.005 < r < 
0.02, so we deem a precision of 0.001 sufficient. The gene 
fusion network is our smallest network but the other networks 
are not harder to converge. When one edge-pair is swapped so 
that r decreases, the only term of Eq.[T]that changes is (ki ^2)- 
The potential change of the sum Yj{ij)eE ^i^ji in the calcula- 
tion of {k\ ki) (close to the extrema) is of the order of the typ- 
ical degree values of the network. These values grow slower 
than the network itself, which means that a larger network can 
be closer in r, but further away in number of edge swaps to 
reach the global optimum, than a smaller network. Some au- 
thors use kikj to measure the degree correlations, 
but since we strive for a macroscopic level of description (con- 
sistent in the large-A^ limit), r is a more appropriate quantity 
for the present work. 

The optimization of the clustering to find the minima 
(maxima) of the segments of assortativity space follows the 
same pattern as the method to find the minimal (maximal) r. 
Changes of the parameters (Vsame and v,ep.) have the same ef- 
fect as in Fig. Ill and the same values seem sufficient. 



2. Sampling uniformity 

The other technical issue we address in this Appendix is 
ffie uniformity of our sampling procedure. Ideally we would 
like all unique (i.e., non-isomorphic) members of @(G) to be 
sampled with the same probability. The most important ob- 
servation is trivial — ^by edge-pair swapping one can go from 
one member of Q{G) to any other, and thus all members of the 
ensemble will contribute to the averages. A much harder ques- 
tion is whether or not every member of 0{G) is sampled with 
uniform probability. In this section, we will argue that our al- 
gorithm does a reasonably good job in the sense that there are 
no inconsistencies and parameter values are appropriate. 

When the target pixel is found (step [15] of the algorithm) 
we perform Vmd. additional random edge-pair swaps. The idea 
is to sample the ^(G)-members of the pixel more uniformly 
(and indeed to be able to reach into the interior of the pixel). 
In Fig. [8ja) we illustrate the effect of these random moves. 
We plot a normalized histogram of the relative largest clus- 
ter size s for 0, 100 and 10000 random moves. We see that 
ffiese moves do make a difference (the Vmd. = is different 
from the Vmd. = 100) but it does not matter if Vmd. = 100 or 
Vrnd. = 10000. The same situation is observed for other pixels, 
networks and quantities. Therefore, we use Vmd. - 1000 in 
ffiis work. 

Next, we will illustrate the use of the randomly permuted 
list in the sampling of the pixels (steps [TT] and [12] of the al- 
gorithm). The motivation for this procedure is that the net- 
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FIG. 8 Histograms of s for the discussion of sampling uniformity. 
All the histograms are from the gene fusion network and a pixel 
centered around r = -0.1, C = 0. 1 (the dimensions of a pixel is 
Ar = 0.013, AC = 0.0096. The error bars represent standard errors. 
Lines are guides for the eyes, (a) shows the histograms with a dif- 
ferent numbers of random edge-pair swappings v,„(i. within the pixel 
before the measurements of quantities, (b) illustrates the location 
of the starting point pixels used in panels (c) and (d). (c) compares 
histograms for swapping processes starting at W, S with the regular 
algorithm, (d) compares the average histogram of walks starting in 
the four peripheral points of (b) with the result of the regular algo- 
rithm. In panels (c) and (d) y|.„d. = 1000. The whole range of the 
histograms is not shown, which is why the areas under the curves 
appear different. 



work structure can depend on the direction from which the 
search arrives to the pixel. In Fig. [8jb) we illustrate the test 
procedure — we sample separate histograms from four starting 
points in the four cardinal directions with respect to the cen- 
ti-al (r, C) = (-0.1, 0.1) pixel. In Fig.[8];c) we see that the his- 
tograms from ffie W and S pixels are different. There appears 
to be two regions of 0(G) contributing to these histograms 
(one with s ^ 0.65, one with s ^ 0.75). Searches starting from 
W seem to arrive at ffie s « 0.75 region more frequently, and 
searches staring at S ends up around s «j 0.65 more frequently. 
The curve of the actual algorithm weighs the two peaks more 
equal. The curves from N and E coincides almost completely 
the curve for the regular algorithm (and are therefore omitted 
for clarity). The impression we get is that the search from one 
direction can induce a bias in the network structure (symbol- 
ically speaking, the graphs have a preference for ending up 
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in a certain region of @{G)). However, from other directions, 
or by the random sampling of pixels (stepfTTT). the bias is re- 
duced. This picture is further strengthened in Fig. [Hid) where 
we show that the average value of the histograms from the 
four starting points are overlapping with the histogram of the 
regular algorithm. 
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